Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling

نویسندگان

  • Junha Shin
  • Insuk Lee
  • Philip M. Kim
چکیده

Phylogenetic profiling, a network inference method based on gene inheritance profiles, has been widely used to construct functional gene networks in microbes. However, its utility for network inference in higher eukaryotes has been limited. An improved algorithm with an in-depth understanding of pathway evolution may overcome this limitation. In this study, we investigated the effects of taxonomic structures on co-inheritance analysis using 2,144 reference species in four query species: Escherichia coli, Saccharomyces cerevisiae, Arabidopsis thaliana, and Homo sapiens. We observed three clusters of reference species based on a principal component analysis of the phylogenetic profiles, which correspond to the three domains of life-Archaea, Bacteria, and Eukaryota-suggesting that pathways inherit primarily within specific domains or lower-ranked taxonomic groups during speciation. Hence, the co-inheritance pattern within a taxonomic group may be eroded by confounding inheritance patterns from irrelevant taxonomic groups. We demonstrated that co-inheritance analysis within domains substantially improved network inference not only in microbe species but also in the higher eukaryotes, including humans. Although we observed two sub-domain clusters of reference species within Eukaryota, co-inheritance analysis within these sub-domain taxonomic groups only marginally improved network inference. Therefore, we conclude that co-inheritance analysis within domains is the optimal approach to network inference with the given reference species. The construction of a series of human gene networks with increasing sample sizes of the reference species for each domain revealed that the size of the high-accuracy networks increased as additional reference species genomes were included, suggesting that within-domain co-inheritance analysis will continue to expand human gene networks as genomes of additional species are sequenced. Taken together, we propose that co-inheritance analysis within the domains of life will greatly potentiate the use of the expected onslaught of sequenced genomes in the study of molecular pathways in higher eukaryotes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ProtPhylo: identification of protein–phenotype and protein–protein functional associations via phylogenetic profiling

ProtPhylo is a web-based tool to identify proteins that are functionally linked to either a phenotype or a protein of interest based on co-evolution. ProtPhylo infers functional associations by comparing protein phylogenetic profiles (co-occurrence patterns of orthology relationships) for more than 9.7 million non-redundant protein sequences from all three domains of life. Users can query any o...

متن کامل

Sequence Analysis and Phylogenetic Profiling of the Nonstructural (NS) Genes of H9N2 Influenza A Viruses Isolated in Iran during 1998-2007

The earliest evidences on circulation of Avian Influenza (AI) virus on the Iranian poultry farms date back to 1998. Great economic losses through dramatic drop in egg production and high mortality rates are characteristically attributed to H9N2 AI virus. In the present work non-structural (NS) genes of 10 Iranian H9N2 chicken AI viruses collected during 1998-2007 were fully sequenced and subjec...

متن کامل

Genetic Co-Occurrence Network across Sequenced Microbes

The phenotype of any organism on earth is, in large part, the consequence of interplay between numerous gene products encoded in the genome, and such interplay between gene products affects the evolutionary fate of the genome itself through the resulting phenotype. In this regard, contemporary genomes can be used as molecular records that reveal associations of various genes working in their na...

متن کامل

Protein profiling for phylogenetic relationship in snakehead species

Protein banding pattern of eight snakeheads – Channa species viz., Channa striatus, Channa marulius, Channa punctatus, Channa diplogramme, Channa bleheri, Channa gachua, Channa stewartii and Channa aurantimaculata collected from different regions of India were used to study the phylogenetic relationship among them. The banding pattern from muscle protein indicated a unique profile for each spec...

متن کامل

Protein profiling for phylogenetic relationship in snakehead species

Protein banding pattern of eight snakeheads – Channa species viz., Channa striatus, Channa marulius, Channa punctatus, Channa diplogramme, Channa bleheri, Channa gachua, Channa stewartii and Channa aurantimaculata collected from different regions of India were used to study the phylogenetic relationship among them. The banding pattern from muscle protein indicated a unique profile for each spec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2015